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ABSTRACT 

The RNA helicase Upf1 is a multifaceted eukaryotic 
enzyme involved in DNA replication, telomere me- 
tabolism and several mRNA degradation pathways. 
Upf1 plays a central role in nonsense-mediated 
mRNA decay (NMD), a surveillance process in 
which it links premature translation termination to 
mRNA degradation with its conserved partners 
Upf2 and Upf3. In human, both the ATP-dependent 
RNA helicase activity and the phosphorylation of 
Upf1 are essential for NMD. Upf1 activation occurs 
when Upf2 binds its N-terminal domain, switching 
the enzyme to the active form. Here, we uncovered 
that the C-terminal domain of Upf1, conserved in 
higher eukaryotes and containing several essential 
phosphorylation sites, also inhibits the flanking 
helicase domain. With different biochemical 
approaches we show that this domain, named SQ, 
directly interacts with the helicase domain to 
impede ATP hydrolysis and RNA unwinding. The 
phosphorylation sites in the distal half of the SQ 
domain are not directly involved in this inhibition. 
Therefore, in the absence of multiple binding 
partners, Upf1 is securely maintained in an inactive 
state by two intramolecular inhibition mechanisms. 
This study underlines the tight and intricate regula- 
tion pathways required to activate multifunctional 
RNA helicases like Upf1. 

INTRODUCTION 

RNA helicases are highly conserved enzymes that use 
NTPs to drive the dissociation of RNA duplexes or the 
remodelling of RNA-protein complexes (1,2). Most RNA 
helicases are monomeric, non-toroidal enzymes, which fall 
into two distinct superfamilies (SF1 and SF2) on the basis 
of sequence conservation (3,4). These enzymes are involved 
in nearly all aspects of RNA metabolism, acting on many 



distinct substrates and being associated with a wide range 
of functions. The basic structural fold of the helicase core is 
highly conserved in both superfamilies. It is composed of 
two RecA-like tandem domains facing each other and 
forming a cleft for nucleotide binding. Despite this struc- 
tural homology, each RNA helicase has very specific 
physiological targets and functions (4). Although the 
exact mechanisms that confer this specificity remain 
unknown in most cases, they depend presumably on the 
presence of accessory domain insertions and/or N- and 
C-terminal domain additions to the helicase core (2,5). 
Such molecular appendages can allow the regulation of 
helicase activity by mediating the binding of molecular 
partners or the formation of multimeric complexes (6). 
One important challenge is to determine how these 
helicases are regulated to achieve specific functions. 

Upfl (for up-frameshift) is a SF1 RNA helicase that is 
essential for several cellular processes including 
Staufen-1 -mediated mRNA decay (SMD) (7), histone 
mRNA degradation (8), DNA replication (9) and 
telomere metabolism (10). To date, the best-documented 
function of Upfl concerns its role in nonsense-mediated 
mRNA Decay (NMD). NMD is a quality control process 
that identifies and degrades mRNAs harbouring a prema- 
ture termination codon (PTC). This 'surveillance' mech- 
anism limits the synthesis of truncated proteins potentially 
harmful for the cell. PTCs arise from DNA mutations or 
errors that occur during mRNA synthesis and processing 
(1 1). About one-third of human genetic diseases and many 
types of cancers are associated with mutations that 
produce a PTC (12,13). NMD also participates in the 
regulation of gene expression by targeting a significant 
fraction (5-10%) of physiological mRNAs (14-16). Upfl 
is essential for NMD in all eukaryotes even though the 
signals that define a premature stop codon differ between 
organisms (17). One pathway that links PTC recognition 
by the translation machinery to Upfl action has been well 
described in mammals (18). Briefly, mammalian mRNAs 
are associated with exon-junction complexes (EJC) that 
are deposited as a consequence of pre-mRNA splicing 
upstream of exon-junctions (19,20). EJCs accompany 
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mRNAs from the nucleus to the cytoplasm where NMD 
factors Upf3 and Upf2 are recruited. During the first 
round of translation, the PTC induces the recruitment of 
the SMGl-Upfl-eRFl-eRF3 (SURF) complex, including 
translation termination factors eRFl-3, Upfl and its 
kinase SMG1 (21-23). If an EJC is present downstream 
of the stalled ribosome, the SURF complex interacts with 
EJC to form the decay-inducing complex [DECID; (23)]. 
The formation of this surveillance complex triggers Upfl 
phosphorylation by SMG1 (23-25). Both the phosphoryl- 
ation state and enzymatic activity of Upfl are essential to 
commit definitively the targeted mRNA to degradation 
(26,27). 

Recent biochemical and structural studies have begun 
to explain how the activity of Upfl is regulated by its 
NMD-binding partners. Upfl possesses a conserved 
helicase core able to unwind double-stranded nucleic 
acids in a 5'- to 3'-direction (21,28,29). The helicase 
core of the human protein contains two internal inser- 
tions and is surrounded by two terminal domains. The 
N-terminal domain is rich in cysteines and histidines 
(CH domain) and the C-terminal region is rich in 
serine-glutamine clusters (SQ domain). The role of the 
CH domain, which exerts a cis inhibitory action on 
both the ATPase and unwinding activities of the Upfl 
helicase core (29), has been recently elucidated. The 
domain forces Upfl to bind RNA in a clamping con- 
formation that prevents its function. Inhibition can be 
relieved by binding of the Upf2 factor to the CH 
domain. This interaction triggers the displacement of 
the CH domain from its original position, thereby 
promoting a conformational change and the activation 
of the helicase core (30). At present, the molecular 
function of the SQ domain is much less understood as 
most biochemical studies have been performed with Upfl 
constructs truncated at their C-terminal domain. 

Comparison of Upfl sequences from eukaryotes shows 
that the SQ domain is much less conserved than the CH 
domain (31,32). The SQ domain is notably absent in lower 
eukaryotic organisms such as yeasts. However, the align- 
ment of SQ domain sequences from 12 distantly related 
vertebrate genomes reveals a high degree of identity over 
the whole domain (Supplementary Figure SI), suggesting 
functional importance of the SQ domain in higher eukary- 
otes. Consistent with this, the SQ domain of human Upfl 
contains several phosphorylation sites that are essential 
for its activity in vivo and its interaction with cofactors 
(23-25). At present, direct structural information is 
lacking for the SQ domain. Secondary structure prediction 
programs identify only small structured regions within a 
largely unstructured fold (Supplementary Figure SI). 

Here, we have used a combination of biochemical 
assays to investigate the role of the SQ domain in the 
activity of human Upfl. We found that the SQ domain 
is able to lock the Upfl helicase core in a conformation 
that is not productive for ATP hydrolysis or duplex 
unwinding. This inhibitory activity is mediated by direct 
binding of the SQ domain to the helicase core and is not 
directly affected by SQ phosphorylation. Moreover, 
SQ-mediated inhibition does not require the CH repressor 
domain and is not subject to direct regulation by Upf2 or 



Upf3 cofactors. Altogether our data unveil a new 
auto-inhibitory mechanism of human Upfl and reveal a 
complex regulatory network whereby distinct intramo- 
lecular switches independently govern Upfl activity. 
This intricate regulation is, to the best of our knowledge, 
unprecedented for RNA helicases and most likely reflects 
the necessity for the tight enzymatic control of central, 
multifunctional enzymes such as human Upfl. 



MATERIALS AND METHODS 

An exhaustive description of the following methods can be 
found in Fiorini et al. (33). 

Purification of Upfl proteins 

The Upfl expression vectors were prepared as previously 
described (29,33). The human proteins were expressed in 
Escherichia coli BL21 (DE3) Rosetta cells (Novagen) 
grown in LB medium and induced overnight at 16°C. 
The cells were lysed in buffer A [1.5x PBS pH 7.5, 
225 mM NaCl, ImM magnesium acetate, 0.1% (w/v) 
NP-40, 20 mM imidazole, 10% (w/v) glycerol] supple- 
mented with 100|ig/ml of egg white lysozyme 
(Sigma- Aldrich) and with lx protease inhibitor cocktail 
EDTA-Free (Sigma- Aldrich). Soluble lysate was applied 
to a pre-packed nickel column (HisTrap FF crude, GE 
Healthcare) and fractioned on an Akta Purifier (GE 
Healthcare) using a linear gradient from buffer A to B 
(buffer A added of 0.5 M imidazole) over 20 column 
volumes. The Upfl proteins fused to a hexahistidine tag 
were subsequently purified on a heparin column (HiTrap 
heparin HP, GE Healthcare). Proteins that contained a 
calmodulin-binding peptide (CBP) were further purified 
on a calmodulin affinity column. Loading of the samples 
and column washes were performed with a buffer 
composed of lx PBS pH 7.5, 150mM NaCl, 1 mM mag- 
nesium acetate, 0.1% (w/v) NP-40, ImM DTT, 4mM 
calcium chloride and 10% (w/v) glycerol. Sample elution 
from the column was performed with the same buffer con- 
taining 20 mM EGTA instead of calcium chloride. To 
purify the Upfl-HD-Upfl-SQ complex, the two proteins 
(Upfl-HD-His and CBP-Upfl-SQ-His) were co-purified 
after co-lysis of bacterial pellets obtained from two 
separate bacterial cultures. Ly sates were treated with 
RNase A (50ug/ml) for 2h at 4°C before column purifi- 
cation of the complex. Alternatively, a Upfl-HD- 
Upfl-SQ complex was also obtained by co-expression of 
the two proteins (CBP-Upfl-HD and Upfl-SQ-His) 
expressed from the same plasmid (34) before 
co-purification onto Nickel and Calmodulin resins. The 
Upfl-CH/HD K498A and Upfl-HD-SQ S1078E/S1096E 
mutants were engineered with the QuikChange kit 
(Stratagene) and purified using the above protocol. 

Duplex unwinding assay 

For this assay, we used a 21-nt RNA-DNA hybrid con- 
taining a 12-nt long 5'-ssRNA overhang (Figure IB). 
Transcription of the RNA, radiolabelling of the DNA 
oligonucleotide, purification of the RNA-DNA duplex 
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and single-run helicase reactions were carried out as in 
Fiorini et al. (33) with minor modifications. Briefly, the 
32 P-labelled RNA-DNA hybrid substrate (1 nM, final 
concentration) was mixed with an excess of Upf 1 protein 
(50 nM) in helicase buffer (20 mM MES, pH 6.0, 100 mM 
potassium acetate, 1 mM DTT, 0.1 mM EDTA) and 
incubated for 5min at 37°C. The reaction was initiated 
by adding a mixture containing ATP and MgCl 2 (1 mM, 
final concentrations), an excess of cold DNA strand 
(0.3 uM; to trap released RNA strands) and heparin 
(1 mg/ml; to trap free/released Upfl molecules). In prelim- 
inary experiments, we observed that lower concentrations 
of heparin were not sufficient to establish the pseudo-first- 
order kinetic regimen characteristic of single-run helicase 
conditions (35). However, heparin interferes with Upfl- 
Upf2 interaction, so that all the unwinding reactions con- 
taining Upf2 were carried out without heparin. Reaction 
aliquots were withdrawn at various times, quenched with 
150mM sodium acetate, 10 mM EDTA, 0.5% (w/v) SDS, 
25% (w/v) Ficoll-400, 0.05% (w/v) xylene cyanol, 0.05% 
(w/v) bromophenol blue and analysed by polyacrylamide 
gel electrophoresis and phosphorimaging on a Typhoon- 
Trio (GE Healthcare), as described (30,33,36). 

RNA and protein co-precipitation 

The RNA co-precipitation and CBP pull-down were per- 
formed as described previously (29,33,34). Briefly, in RNA 
co-precipitation assay, proteins or preformed protein 
complex (2ug) were mixed with 70pmol of 3 / -end 
biotinylated ssRNA (30 nt) in binding buffer [20 mM 
HEPES pH 7.5, 150mM potassium acetate, 2mM magne- 
sium acetate, 1 mM DTT, 6.3% (v/v) glycerol and 0.1% 
(w/v) NP-40]. The reactions were supplemented (or not) 
with 3.3 mM ADPNP or ADP in a final volume of 30 ul 
and incubated for 20min at 30°C. Then, 5 ul of pre-coated 
streptavidin-coupled magnetic beads (Dynabeads, Life 
technologies) were added before further incubation for 
1 h at 4°C. Unless indicated otherwise, the beads were 
washed with a buffer containing 200 mM potassium 
acetate. Proteins were eluted by adding 7.5 ul of SDS 
loading buffer directly to the beads. The various fractions 
were subsequently analysed by 13.5% SDS-PAGE. For 
precipitation of protein complexes by CBP-bait protein, 
the magnetic beads were replaced with 12 ul of pre-coated 
calmodulin resin (50% slurry, Stratagene). The resin was 
washed three times with 500 ul BB-200 (200 mM potas- 
sium acetate) and eluted with 20 mM EGTA in 20 ml 
BB-150. Eluates were separated on 10% SDS-PAGE 
and visualized by comassie staining. 

ATP binding 

For ATP binding, we essentially followed the protocol 
described in Cheng et al. (37). Briefly, hUpfl proteins 
(2ug) were spotted on nitrocellulose membrane. The 
membrane was soaked in blocking buffer [20 mM 
HEPES pH 7.0, 50 mM K Acetate, 2.5 mM Mg Acetate, 
2mM DTT, 3% BSA (w/v), 10% (v/v) glycerol] and 
incubated on a rocking platform for 1 h at room tempera- 
ture. Then, the blocking buffer was replaced by binding 
buffer [20 mM HEPES pH 7.0, 50 mM K acetate, 2.5 mM 



Mg acetate, 2mM DTT, 1.5% (w/v) BSA, 10% (v/v) gly- 
cerol] supplemented with 30 uGi of [oc 32 P]-ATP 
(800 Ci/mmol; Perkin Elmer) or 30 u€i of [y 32 P]-ATP 
(lOmCi/ml, 3000Ci/mmol; Perkin Elmer) before further 
incubation for 20min at room temperature. The 
membrane was washed twice with blocking buffer before 
being dried and analysed by phosphorimaging. 

ATP hydrolysis 

For steady-state experiments, Upfl (5pmol) was 
incubated at 30° C in a 10- ul reaction mixture containing 
lx ATPase buffer [20 mM MES pH 6.0, 100 mM potas- 
sium acetate, 1 mM DTT, 0.1 mM EDTA, ImM magne- 
sium acetate, 1 uM zinc sulphate and 5% (v/v) glycerol], 
2 uCi of [oc 32 P]-ATP (800 Ci/mmol, Perkin Elmer), 25 uM 
ATP and 0.5 mM polyU (concentration in rU residues). 
The single-cycle ATPase experiments were performed by 
incubating 7.5pmol of Upfl with 5nM of [oc 32 P]-ATP 
(800 Ci/mmol) and 50 uM of polyU in 15ul of lx 
ATPase buffer at 30°C. Control experiments were also 
performed in absence of polyU in the reaction. To 
account for potential RNA contamination of the protein 
preparations, samples deprived of polyU were first 
pre-incubated with 500 U of RNaseTl (Thermo scientific), 
5 jig of RNaseA (Thermo scientific) and 50 U of RNasel 
(Ambion) for lOmin at 30°C before addition of [oc 32 P]- 
ATP. Single-cycle ATPase samples containing polyU 
were treated under the same conditions but without 
RNases. Reaction aliquots (2ul) were withdrawn at 
various times and quenched with a buffer (5 ul) containing 
10 mM EDTA and 0.5% (v/v) SDS. Samples were 
analysed by phosphorimaging after thin layer chromatog- 
raphy on polyethyleneimine cellulose plates (Merck) with 
0.35 M potassium phosphate (pH 7.5) as elution buffer. 

Electrophoretic mobility shift assay 

Samples were prepared by mixing a radiolabeled 30-mer 
oligoribonucleotide (1 nM) with Upfl protein (0, 0.5, 1, 2, 
5, 10, 20, 30 or 50 nM) and with or without ADPNP 
(3.5 mM) in a buffer containing 20 mM MES pH 6.0, 
lOOmM potassium acetate, 2mM DTT, 0.2ug/ul of 
BSA and 6% (v/v) glycerol. The samples were incubated 
at 30°C for 20min before being resolved by native 6.5% 
polyacrylamide (19:1) gel electrophoresis and analysed by 
phosphorimaging. 

RESULTS 

Production of recombinant Upfl proteins 

To determine the function of the SQ domain, we expressed 
and purified different recombinant versions of human 
Upfl (Figure 1A). The boundaries between the 
N-terminal CH domain, the central helicase domain 
(HD) and the C-terminal SQ domain were defined accord- 
ing to previous structural studies (30,38-40). We produced 
the full-length protein [Upfl-FL; amino acids (aa) 
1-1118], the helicase core domain (Upfl-HD; aa 
295-914), a protein containing both the CH and HD 
domains (Upfl-CH/HD; aa 115-914), and a protein 
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containing both the HD and the SQ domain (Upfl-HD/ 
SQ; 295-1118 aa). Attempts at purifying the SQ domain 
alone (Upfl-SQ; 915-1118 aa) were unsuccessful, 
although the domain could be obtained in co-purification 
experiments (described later). Each protein was fused to a 
CBP at the N-terminus and to a hexahistidine tag at the 
C-terminus and was purified by successive affinity steps 
with nickel and calmodulin resin columns (33,41). 

The helicase activity of Upf 1 is inhibited by the SQ 
domain 

Previous observations showed that Upfl directionally 
unwinds double-stranded nucleic acids (21,28,29). To 
determine whether the SQ domain affects Upfl activity, 
we first monitored the capacity of the enzyme to unwind 
an RNA-DNA duplex using a gel shift assay (42). The 
duplex substrate was formed by annealing a 33-mer RNA 
strand with a 32 P-labelled DNA oligonucleotide and 
contained a 21 -bp RNA-DNA hybrid helix 
[AG° 37 = -31.6 kcal mol -1 , (43)] downstream of a 
single-stranded RNA (ssRNA) 'tail' of 12 nt (Figure IB). 
According to the size of the RNA-binding site deduced 
from crystal structures of Upfl-HD [9-11 nt; (30)], the 
length of the ssRNA tail is sufficient to accommodate 
only one Upfl molecule upstream from the reporter 
RNA-DNA helix and short enough to prevent coopera- 
tive effects between multiply-bound helicase molecules 
(44,45). The RNA-DNA hybrid substrate was 
pre-incubated with an excess of Upfl protein to favour 
the formation of an enzyme-substrate complex and to 
monitor the unwinding reaction under a pre-steady state 
regimen (35). The unwinding reaction was initiated by 
adding a mixture containing ATP, MgCl 2 and an excess 
of unlabelled DNA strand to 'trap' the RNA strands 
released over time [see 'Materials and Methods section 
and (33)]. The initiation mix also contained heparin in 
amounts sufficient to trap free Upfl molecules 
(i.e. RNA-unbound) so that only the first round of un- 
winding was monitored during the reaction (single-run 
conditions). Unwinding efficiencies were deduced from 
the proportions of 32 P-labelled single-stranded DNA 
accumulated over time. 

Since the CH domain exerts an inhibitory effect on the 
adjacent HD (29,30), we first tested whether the SQ 
domain has similar effect. We monitored the activities of 
the Upfl-HD and Upfl-HD/SQ proteins. Remarkably, 
the unwinding activity of Upfl-HD is strongly affected 
by the presence of the SQ domain (Figure 1C). Indeed, 
the amplitude of the reaction with Upfl-HD/SQ is 
reduced by ~6-fold when compared with that obtained 
with Upfl-HD (Figure ID). The rate of the single-run 
unwinding reaction, k ohs , is also affected by the presence 
of the SQ domain but to a lesser extent (~2-fold; see table 
in Figure ID). Moreover, when ATP is replaced by ADP 
in the reaction, the fraction of duplex unwound over time 
is negligible and comparable for both the Upfl-HD and 
Upfl-HD/SQ proteins (Figure 1C and data not shown). 
These data clearly demonstrate that the SQ domain 
inhibits the helicase activity of Upfl . 



We next tested whether the inhibitory effect of the SQ 
domain is also exerted in the context of the full-length 
protein (Upfl-FL). Upf2 interacts with the CH domain 
relieving its inhibitory effect on helicase activity (30). 
Thus, to discount a potential interference of the CH 
domain, we first verified that its binding activator Upf2 
is able to interact with Upfl-FL. To do this, we used a 
co-precipitation assay (29,33,34) wherein CBP-tagged 
Upf2 (761-1227) was used as bait and the protein 
complexes were purified on calmodulin affinity-resin 
before analysis by SDS-PAGE (Supplementary Figure 
S2A). As expected, Upfl-HD did not co-precipitate with 
Upf2 (lane 4), which confirms that the CH domain is 
required for the interaction (29). In contrast, Upfl-FL 
produced in E. coli (Upfl-FLc) as well as a 
phosphorylated form produced in baculovirus 
(Upfl-FLb; kindly provided by Elena Conti) 
co-precipitated with Upf2 (Supplementary Figure S2A, 
lanes 5 and 6). Controls performed without Upf2 rule 
out that co-precipitation could stem from non-specific 
binding of the Upfl-FL proteins to the beads 
(Supplementary Figure S2A, lanes 1-3). Thus, Upf2 is 
able to interact specifically with Upfl-FL, which 
supports that the CH domain remains fully accessible 
and amenable to Upf2 regulation in the context of 
full-length protein. Based on these findings, the unwinding 
activity of Upfl-FL was monitored in the presence of a 
molar excess of Upf2 to relieve the inhibitory effect of the 
CH domain (30). We observed that Upf2 is not sufficient 
to activate Upfl-FL (Supplementary Figure S2B), 
whereas under identical conditions, Upf2 activates 
Upfl-CH/HD (30). These results show that the repression 
exerted by the SQ domain cannot be relieved by the 
presence of Upf2 bound to the N-terminal CH domain. 
Taken together, these data strongly suggest that the 
helicase core domain of Upfl is independently repressed 
by both the CH and the SQ flanking domains. 

The SQ region physically interacts with the HD 

In order to dissect the mechanism(s) used by the SQ 
domain to inhibit Upfl, we attempted to prepare the 
domain as a separate polypeptide through expression in 
E. coli cells. However, the isolated SQ domain, most likely 
unstructured, cannot be satisfactorily solubilized under 
native conditions (data not shown). We thus tempted to 
co-purify the SQ and Upfl-HD proteins, each one indi- 
vidually expressed from independent cellular batches and 
then co-purified (see 'Materials and Methods' section). We 
made the hypothesis that, if the two domains interact, the 
HD may help to solubilize the SQ domain. In this particu- 
lar experiment, both Upfl-HD and Upfl-SQ contained a 
C-terminal hexahistidine tag while only Upfl-SQ was also 
fused to a N-terminal CBP tag. Following co-lysis of the 
cell pellets (see 'Materials and Methods' section), sequen- 
tial purification on Nickel and calmodulin resins yielded a 
bimolecular complex (Supplementary Figure S3 A, lane 8). 
This bimolecular complex was obtained after RNase 
A treatment and extensive washes under stringent condi- 
tions (0.5 M NaCl) during calmodulin affinity purification, 
which shows that the SQ domain makes strong and 
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Kaleidagraph (Synergy software) to the pseudo-first order equation y = A[l 



'], where A and k represent, respectively, the amplitude and 



the rate constant of the unwinding reaction (35). The values for the amplitudes (^4) and rate constants (k) are tabulated in the panel inset. 



specific contacts with the HD in a RNA-independent 
manner. We also tested the co-expression of CBP-Upfl- 
HD and Upfl-SQ-His from the same plasmid but this 
method yielded too little material for enzymatic assays 
(Figure 2A). To confirm the pertinence of our purification 



procedure, we attempted to co-purify the SQ domain with 
the non-cognate RNA helicase eIF4AIII. This enzyme is 
an archetypal member of the DEAD-box family (SF2 
helicases) and contains the two conserved RecA-like 
domains characteristic of eukaryotic RNA helicases (4). 
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(A) A SDS-PAGE gel illustrating the co-expression and co-purification 
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However, we were unable to co-purify eIF4AIII with 
CBP-Upfl-SQ-His (Supplementary Figure S3B). This ob- 
servation supports that only strong and specific complexes 
such as the one formed between Upfl-SQ and Upfl-HD 
sustain our harsh purification conditions. 

The stability of the complex formed between the 
helicase and SQ domains was further tested using RNA 
pull-down experiments. The preformed complex was 
incubated with a 30-mer biotinylated ssRNA and 
streptavidin beads. Then, the beads were washed with a 
solution of defined ionic strength (0.2-0.5 M potassium 
acetate) before analysing the co-precipitated proteins. As 
shown in Figure 2B, both Upfl-SQ and Upfl-HD 
proteins were precipitated with the beads in the presence 



of the biotinylated RNA (lane 2) but not in its absence 
(lane 3). Moreover, increasing the concentration of potas- 
sium acetate from 0.2 to 0.5 M did not markedly affect the 
amounts of Upfl-SQ and Upfl-HD proteins that were 
recovered with the beads (lanes 4-6). These results 
suggest that neither RNA nor the SQ domain dramatically 
alter each other's capacity to interact with the Upfl 
helicase core in vitro. They also suggest that the inhibitory 
effect of the SQ domain cannot be ascribed to a dramatic 
weakening of the Upfl -substrate interaction. Since the SQ 
domain cannot be tested alone, we, however, cannot rule 
out an alternative scenario whereby both the HD and 
SQ domains would interact with RNA without contacting 
one another. 

We next determined if the SQ domain could inhibit 
Upfl activity when provided in trans by measuring the 
unwinding activity of the preformed Upfl-HD-Upfl-SQ 
complex. We observed that the inhibition of Upfl activity 
was even stronger for the bimolecular Upfl-HD-Upfl-SQ 
complex than for the Upfl-HD/SQ protein (Figure 1C 
and D; compare middle and right panels and correspond- 
ing graphs), supporting that the SQ domain modulates 
helicase activity by directly interacting with the enzyme 
core. The difference in inhibitory extents between 
Upfl-HD/SQ and the binary complex may be due to a 
greater conformational heterogeneity of the Upfl-HD/ 
SQ protein in which the SQ domain may not always 
interact adequately with the HD. In contrast, entropic 
constraints imposed by a physical link between both 
domains do not exist for the binary Upfl-HD-Upfl-SQ 
complex wherein the SQ domain must strongly and spe- 
cifically interact with the helicase core. 

The region of the SQ domain containing phosphorylation 
sites is not required for the inhibition of Upfl helicase 
activity 

Upfl phosphorylation is a critical consequence of PTC 
recognition and EJC interaction during NMD process in 
human (23,25,26). Several phosphor-acceptor residues 
have been identified in vitro, but only phosphorylation at 
Thr 28 in the N-terminus and at Ser 1078 and Ser 1096 in the 
C- terminus have been detected in vivo so far (24-26). 
Despite its functional importance, the role of Upfl phos- 
phorylation remains unclear at the molecular level. One 
possibility is that phosphorylated Ser 1078 and Ser 1096 
residues modulate the regulatory function of the SQ 
domain by affecting directly (or not) the binding of the 
SQ domain to the HD. It has been shown that the replace- 
ment of serine by a negatively charged residue such as 
glutamate could possibly mimic phosphorylation (46,47). 
To test the potential role of the Ser 1078 and Ser 1096 residues 
in the regulation of Upfl activity, we have thus prepared a 
phosphomimetic Upfl-HD/SQ variant that contains two 
glutamate substitutions at positions 1078 and 1096. 
However, these substitutions had no stimulatory effect 
on the unwinding activity of Upfl (data not shown), 
suggesting that the phosphorylation state of the Ser 1078 
and Ser 1096 residues do not directly interfere with SQ 
domain action. 
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In order to test a genuinely phosphorylated form of 
Upfl, we also used full-length Upfl-FLb produced in 
Baculovirus. Mass spectrometry analysis confirmed the 
presence of 1-6 phosphorylated residues without precisely 
localizing them [E. Conti (personal communication)]. To 
account for the presence of the CH repressor domain, we 
have monitored the unwinding activity of Upfl-FLb in the 
presence of the bound Upf2 activator (Supplementary 
Figure S2A, lane 6). Again, the Upfl-Upf2 complex 
showed no unwinding activity (Supplementary Figure 
S2B, right-most panel), suggesting that the inhibitory 



effect of the SQ domain is not alleviated by its 
phosphorylation. 

To confirm the above conclusions and determine the 
portion of the SQ region responsible for Upfl inhibition, 
we have generated three C-terminal truncated variants of 
Upfl-HD/SQ. As depicted in Figure 3 A, the correspond- 
ing C-terminal deletions start from Leucine 1072 
(Upfl-HD/SQ L7 0 72 variant; 46 aa deletion), Glycine 1019 
(Upfl-HD/SQc 7 0 79 ; 99 aa deletion) or Glycine 967 
(Upfl-HD/SQc^; 151 aa deletion), all of which remove 
the phosphorylation sites identified in vivo. We have 
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purified each truncated Upfl-HD/SQ variant (Figure 3B) 
in order to probe their respective unwinding activity. We 
have observed that both Upfl-HD/SQ^y^ and 
Upfl-HD/SQ^ytfip are about as ineffective as the 
full-length Upfl-HD/SQ protein, whereas the shortest 
Upfl-HD/SQ G 967 variant was almost as efficient as 
Upfl-HD (Figure 3C and D). These data show that the 
central region of the SQ domain that is located between 
residues Gly 967 and Gly 1019 is important for its inhibitory 
function. They also show that the C-terminal region con- 
taining the major phosphorylation sites (Figure 3 A) does 
not contribute by itself to SQ-mediated inhibition. 
However, the data do not rule out the possibility that 
phosphorylation of the C-terminus is important for recog- 
nition by accessory factor(s) (see 'Discussion' section). 

The SQ domain impedes Upfl ATP hydrolysis 

To gain further insight into the mechanism of Upfl inhib- 
ition by the SQ domain, we have first used a dot blot assay 
to determine if the Upfl-HD and Upfl-HD/SQ proteins 
and the Upfl-HD-Upfl-SQ complex can bind ATP. In 
this assay, the amounts of [oc 32 P]-ATP retained on a nitro- 
cellulose membrane by the bound proteins were compared 
qualitatively. Although differences were observed, neither 
the Upfl-HD and Upfl-HD/SQ proteins nor the 
Upfl-HD-Upfl-SQ complex displayed drastic defects in 
ATP binding (Figure 4A). In contrast, a variant of 
Upfl-CH/HD carrying a detrimental mutation (R865A) 
in the nucleotide-binding site (21,28,38) was completely 
unable to retain ATP (Figure 4A). Thus, the SQ domain 
does not appear to drastically alter the ability of Upfl to 
bind ATP. Similar results were obtained with [y 32 P]-ATP 
(Supplementary Figure S4A), suggesting that the 
Upfl-HD and Upfl-HD/SQ do not efficiently hydrolyze 
the bound ATP [which is expected in the absence of RNA 
cofactor; (29)] or, alternatively, that they do not readily 
release the Pi produced upon hydrolysis. 

Next, we have measured the steady-state ATP hydrolysis 
rates of the Upfl-HD and Upfl-HD/SQ proteins. The 
purified proteins were incubated with large excesses of 
ATP (including [oc 32 P]-ATP) and RNA cofactor (polyU), 
to favour efficient ATPase turnover (Figure 4B). Under 
these conditions, Upfl-HD/SQ protein showed a weak 
ATPase activity compared with Upfl-HD (Figure 4B) in 
agreement with the poor helicase activity of Upfl-HD/SQ 
(Figure ID). To ensure that these defects were due to in- 
hibition of ATP hydrolysis by the SQ domain rather than 
an ATPase turnover defect (e.g. ADP/Pi release defect), we 
have also performed single-cycle ATPase experiments in 
the presence of a large excess of Upfl-RNA complex 
with respect to ATP concentration (100-fold). Reaction 
mixtures contained saturating concentration of polyU (rep- 
resenting ~100 equivalents of 10-nt-long binding sites per 
protein molecule) to offset potential RNA-binding defects 
triggered by the SQ domain (described later). In this case, 
the rate of single-cycle ATP hydrolysis was about five times 
higher for Upfl-HD than for Upfl-HD/SQ 
(Supplementary Figure S4B and C). In absence of RNA 
cofactor, single-cycle ATP hydrolysis was much slower 
(~15 times) and it was independent of the SQ domain 



(Supplementary Figure S4B and C). Altogether, these 
data attest that the SQ domain impedes RNA-stimulated 
ATP hydrolysis even though it does not physically block 
access of the nucleotide to the Upfl active site. 

The SQ domain uncouples Upfl RNA-binding state from 
the presence of ATP 

The affinity of Upfl for RNA varies during the ATP 
hydrolysis cycle. Notably, the presence of the nucleotide 
in the Upfl active site shifts the helicase core into a 
'closed' conformation that has a lower affinity for 
nucleic acids (30). As is the case for other helicases, 
cycling between 'weak' and 'tight' RNA-binding states 
most likely prompts Upfl activity. Although the SQ 
domain does not prevent Upfl from binding RNA 
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(Figure 2B), we wondered whether it could interfere with 
the NTP-dependent interconversion between the 'weak' 
and 'tight' RNA-binding states. Using an electrophoretic 
mobility shift assay, we have thus assessed the effect of the 
SQ domain on the affinity of Upfl for RNA in the presence 
or absence of the non-hydrolysable nucleotide analogue 
ADPNP. We have incubated a 30-mer radiolabeled 
ssRNA with increasing concentrations of purified 
Upfl-HD or Upfl-HD/SQ and with or without ADPNP. 
In each case, a band shift was observed upon gel electro- 
phoresis as a result of Upfl-RNA complex formation 
(Figure 5). Surprisingly, in the absence of ADPNP, the 
SQ domain markedly reduces the affinity of Upfl for 
the ssRNA (compare outermost panels in Figure 5). The 
presence of ADPNP in the incubation mixture has a similar 
weakening effect on Upfl-HD affinity for RNA (Figure 5, 
left panels), confirming previous observations (29,30). In 
contrast, the presence of ADPNP hardly changes the 
affinity of Upfl-HD/SQ for the ssRNA (Figure 5, right 
panels). Similar results were obtained by RNA pull-down 
assays in which Upfl-HD or Upfl-HD/SQ were 
co-precipitated by a 30-mer biotinylated ssRNA in the 
presence/absence of ADPPNP or ADP (Supplementary 
Figure S5). Thus, the SQ domain appears to constrain 
Upfl in a 'weak' RNA-binding state uncoupled from the 
presence of the nucleotide cofactor. 



DISCUSSION 

Helicases are multitasking enzymes. Many of them 
combine the ability to walk on RNA and/or DNA with 
unwinding double-stranded nucleic acids. They can also 
displace proteins blocking their track or remodel nucleo- 
protein particles. Often the helicase function is quiescent 
and has to be activated through extra-domains, 
self-assembly and/or interactions with molecular 
partners (5,6). Specific activities such as translocation, 
protein displacement or duplex unwinding may also be 



engaged selectively and in a timely fashion. Precise regu- 
lation reflects the importance and complexity of the 
physiological tasks implicating helicases, whereby 
remodelling of supramolecular targets has to occur at 
the right moment, in the right context, and in the appro- 
priate cellular compartment. 

The central role of Upfl in NMD represents a striking 
example of a highly regulated RNA helicase. Recent struc- 
tural and biochemical studies have illuminated the mo- 
lecular mechanisms governing the regulation of Upfl 
during NMD. The Upfl ATPase and helicase activities 
are repressed by an intramolecular interaction between 
the N- terminal CH domain and the central helicase core. 
Binding of the Upf2 cofactor to the CH domain disen- 
gages it from its inhibitory position, thereby prompting 
the enzymatic activity of Upfl (29,30,38,39). Although 
the roles of Upf2 and the CH domain in the control 
of the helicase core are now well established, a global 
view of how regulation is orchestrated in the full-length, 
multi-domain Upfl enzyme is still missing. Notably, the 
function of the C-terminal SQ domain (Figure 1A) — 
which, in higher eukaryotes, is highly conserved 
(Supplementary Figure SI) and a target for phosphoryl- 
ation and cofactor interactions has long remained elusive. 

Using a combination of biochemical approaches, we 
assigned a regulatory function to the SQ domain and 
demonstrated that the CH domain is not the only intra- 
molecular component that represses the activity of Upfl . 
Inhibition of Upfl by the SQ domain (Figure 1C and D) is 
mediated by its stable and specific interaction with the 
helicase core of the enzyme (Figure 2 and 
Supplementary Figure S3). The activity of Upfl is no 
longer restored by the binding of Upf2 to the CH 
domain if the SQ domain is present (i.e. in the context 
of the full-length enzyme; Supplementary Figure S2). 
This suggests that an optimal enzymatic activity of Upfl 
is achieved only if the inhibitory actions of both terminal 
domains are relieved. 
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Recently, a fundamental role for ATPase activity of 
Upfl has emerged in vivo. This activity is essential for 
NMD and is required for the disassembly of the NMD 
factors from mRNA during the late steps of the decay 
process (27,48). We observed that the binding of ATP to 
Upfl is not drastically affected by the presence of the SQ 
domain (Figure 4A and Supplementary Figure S4A), 
whereas ATP hydrolysis is strongly inhibited (Figure 4B, 
Supplementary Figure S4B and C). Thus, the SQ domain 
prevents the ability of Upfl to convert the energy stored in 
the ATP fuel into physical work (mechanochemical trans- 
duction) and hence inhibits the helicase activity. Crystal 
structures show that the helicase core of Upfl is made of 
two flexible RecA-like lobes (Al and A2) that create a 
cleft for ATP binding at their interface. The two lobes 
are relatively distant and can move with respect to each 
other in the absence of ATP. The active site adopts a 
geometry suitable for catalysis only when the nucleotide 
is bound (30,38). Moreover, the presence of ATP has an 
allosteric effect on Upfl helicase core, diminishing the 
affinity of the enzyme for RNA (21,28-30,38). The SQ 
region also reduces the affinity of Upfl for RNA but in 
a manner that becomes unresponsive to the presence of a 
nucleotide cofactor (Figure 5 and Supplementary Figure 
S5). These results suggest that the SQ domain constrains 
Upfl in a conformation wherein RNA-binding compo- 
nents are not adequately positioned for interaction with 
RNA and for stimulation of ATP hydrolysis. 
Alternatively, the binding sites for RNA and the SQ 
domain on the Upfl helicase core may overlap to the 
extent that SQ binding prevents the development of a 
fully productive RNA-bound state of Upfl. The ATP- 
binding/hydrolysis properties of the SQ-containing 
protein are reminiscent of the enzymatic phenotypes dis- 
played by the Upfl-HD (Q665A) and Upfl-HD 
(DE636AA) mutants. These detrimental mutations 
disrupt the network of active site contacts to the (3— and 
y-phosphates of the nucleotide cofactor without affecting 
its binding (38). By analogy, we speculate that the inhibi- 
tory HD-SQ interaction affects the (RNA-driven) 
capacity of phosphate and/or catalytic residues to adopt 
position(s) suitable for ATP hydrolysis. 

PTC-containing mRNA degradation and disassembly of 
NMD factors occur only after Upfl ATPase activation 
(27). This suggests that the protein remains inactive until 
both CH and SQ inhibitions are relieved to ensure efficient 
NMD. A key step during NMD is represented by Upfl 
phosphorylation at N- and C-terminal ends (23-26). One 
may envision that C-terminal phosphorylation of Upfl 
restores enzyme activity by eliminating the auto-inhibition 
exerted by the SQ domain. However, phosphorylation of 
the C-terminal domain (or replacement of phosphorylation 
target sites by glutamate residues) is not sufficient by itself 
to relieve the inhibition of Upfl in vitro (Supplementary 
Figure S2B and data not shown). In fact, we observed 
that Upfl inhibition critically depends on a region of the 
SQ domain located upstream the in vivo phosphorylation 
sites (Figure 3). Since this phosphorylation target region is 
not a direct contributor to SQ-mediated inhibition of Upfl, 
we envision two distinct scenarios, whereby the phosphor- 
ylation target region mediates physiological inactivation of 



the SQ domain and derepression of Upfl . The first scenario 
relies solely on the binding of the SMG1 kinase to Upfl 
that follows interaction of the SURF complex with 
Upf2-Upf3-EJC (49). The second scenario invokes a 
subsequent step whereby additional activating factor(s) 
specifically recognize and bind phosphorylated Upfl. In 
this context, it is tempting to speculate that the 
hyperphosphorylated SQ domain is bound by the SMG5- 
SMG7 heterodimer, which itself recruits protein phosphat- 
ase 2A (PP2A), which in turn catalyses Upfl 
dephosphorylation (23,24,26,50,51). Another candidate 
worth considering is the proline-rich nuclear receptor 
coregulatory protein 2 (PNRC2) that was also found to 
interact with hyperphosphorylated Upfl (52). 
Interestingly, PNRC2 mediates the interaction of 
phosphorylated Upfl with the decapping machinery that 
precedes mRNA decay events (52). 

From all these observations, we propose that during 
assembly of the SURF complex, Upfl is completely 
inactivated by two distinct intramolecular repressor inter- 
actions operated respectively by the CH and SQ terminal 
domains (Figure 6, top diagram). Activation of Upfl is 
initiated during formation of the DECID complex 
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Figure 6. A model of the activation of human Upfl during NMD. 
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whereupon Upf2 binding alleviates the repression exerted 
by the CH domain (Figure 6, middle diagram). In the re- 
sulting intermediate state, Upfl remains catalytically in- 
competent due to persisting SQ-mediated repression. 
Complete activation of Upfl may then occur only when 
the bound SMG1 kinase triggers phosphorylation of Upfl 
or when additional cofactors, such as the SMG5-SMG7 
heterodimer and/or PNRC2, are recruited on the 
phosphorylated SQ domain (Figure 6, bottom diagram). 
Analogous to the mechanism of Upf2 derepression, the 
binding of cofactors to the SQ domain would disrupt 
SQ inhibitory contacts to the helicase core and achieve 
Upfl activation (Figure 6). This sophisticated regulation 
mechanism cannot occur in lower eukaryotes such as 
Saccharomyces cerevisiae wherein Upfl proteins do not 
contain SQ domains (40). SMG orthologues responsible 
of Upfl phosphorylation states are also absent in yeasts 
(40), suggesting that NMD has evolved towards higher 
complexity and greater stringency of regulation from 
lower eukaryotes to metazoans. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Figures 1-5. 
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